Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
๐ ANN Benchmarks
Recall@K, Query Latency, Index Build Time, Memory Usage
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
32440
posts in
15.5
ms
Lower
Latency
and Higher
Throughput
with Multi-node DeepSeek Deployment
research.perplexity.ai
ยท
10h
๐๏ธ
LLM Infrastructure
CACTUSDB
: Unlock Co-Optimization Opportunities for SQL and AI/ML
Inferences
arxiv.org
ยท
1d
๐ฏ
Qdrant
Google AI Introduces STATIC: A Sparse Matrix Framework
Delivering
948x Faster
Constrained
Decoding for LLM Based Generative Retrieval
marktechpost.com
ยท
1d
๐ค
Tokenization
Optimizing
Recommendation Systems with
JDK
โs Vector API
netflixtechblog.com
ยท
6h
ยท
Discuss:
Hacker News
,
r/programming
๐
SIMD Programming
Efficient frequent directions algorithms for approximate decomposition of
matrices
and higher-order
tensors
jmlr.org
ยท
19h
๐
Embeddings Optimization
Ditching
MongoDB Text Indexes for Edge
N-Grams
hjr265.me
ยท
1d
๐ฒ
GIN Indexes
and we have new
evidence
quarkus.io
ยท
6h
โก
Systems Performance
SemiAnalysisAI/InferenceX
: Open Source Continuous Inference Benchmarking Qwen3.5, DeepSeek, GPTOSS - GB200 NVL72 vs MI355X vs B200 vs GB300 NVL72 vs H100 & soonโข
TPUv6e/v7/Trainium2/3
github.com
ยท
1d
๐๏ธ
LLM Infrastructure
The AI
Efficiency
Survey
sambanova.ai
ยท
1d
๐
LLM Benchmarking
Architecting
and
Evaluating
an AI-First Search API
research.perplexity.ai
ยท
10h
๐ท๏ธ
Web Crawling
Optimal Heterogeneous Memory Configs for AI Tasks Under
Specified
Performance Metrics (Stanford,
UCSC
)
semiengineering.com
ยท
1d
๐ง
Memory Hierarchy Design
The
Architecture
Behind Open-Source LLMs
blog.bytebytego.com
ยท
15h
๐๏ธ
LLM Infrastructure
GPU-Native Approximate
Nearest
Neighbor Search with
IVF-RaBitQ
: Fast Index Build and Search
arxiv.org
ยท
1d
๐
Vector Indexing
Qwen 3.5 9B, 4B models beating
30B
,
80B
models
huggingface.co
ยท
12h
ยท
Discuss:
Hacker News
๐
LLM Benchmarking
๐ฅ Optimizing
nested
array operations in PHP: from O(
3n
) to O(n)
yellowduck.be
ยท
1d
๐น
Apache Arrow
Finite Neural Networks as
Mixtures
of Gaussian Processes: From
Provable
Error Bounds to Prior Selection
jmlr.org
ยท
19h
๐ง
LLM Inference
I built a
persistent
memory
layer
for AI agents in Rust
news.ycombinator.com
ยท
12h
ยท
Discuss:
Hacker News
๐
Tantivy
Rare Huawei-ByteDance alliance unveils
RRAM
AI chip delivering 66x CPU speed at
ISSCC
2026
digitimes.com
ยท
15h
๐ฅ๏ธ
Hardware Architecture
Monday AI
Radar
#15
lesswrong.com
ยท
2h
๐
New AI
Inception
Labs
says its diffusion LLM is 10x faster than Claude, ChatGPT, Gemini
thenewstack.io
ยท
10h
๐๏ธ
LLM Infrastructure
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help